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Abstract 

The probabilistic serial (PS) rule is a prominent 
randomized rule for assigning indivisible goods to 
agents. Although it is well known for its good fair¬ 
ness and welfare properties, it is not strategyproof. 
In view of this, we address several fundamental 
questions regarding equilibria under PS. Firstly, we 
show that Nash deviations under the PS rule can 
cycle. Despite the possibilities of cycles, we prove 
that a pure Nash equilibrium is guaranteed to ex¬ 
ist under the PS rule. We then show that verify¬ 
ing whether a given profile is a pure Nash equi¬ 
librium is coNP-complete, and computing a pure 
Nash equilibrium is NP-hard. For two agents, we 
present a linear-time algorithm to compute a pure 
Nash equilibrium which yields the same assign¬ 
ment as the truthful profile. Finally, we conduct ex¬ 
periments to evaluate the quality of the equilibria 
that exist under the PS rule, finding that the vast 
majority of pure Nash equilibria yield social wel¬ 
fare that is at least that of the truthful profile. 


1 Introduction 

Resource allocation is a fundamental and widely applicable 
area within AI and computer science. When resource alloca¬ 
tion rules are not strategyproof and agents do not have in¬ 
centive to report their preferences truthfully, it is important to 
understand the possible manipulations; Nash dynamics; and 
the existence and computation of equilibria. 

In this paper we consider the probabilistic serial (PS) rule 
for the assignment problem. In the assignment problem we 
have a possibly unequal number of agents and objects where 
the agents express preferences over objects and, bas ed on 
these preferences, the objects are allocated to the agents I Aziz 
et al., 2014[ |Bogomolnaia and Moulin, 200T| Gardenfors, 


1973[ Hylland and Zeckhauser, 1979| . The model is appli¬ 


cable to many resource allocation and fair division settings 
where the objects may be public houses, school seats, course 
enrollments, kidneys for transplant, car park spaces, chores, 
joint assets, or time slots in schedules. The probabilistic se¬ 
rial (PS) rule is a randomized (or fractional) assignment rule. 
A randomized or fractional assignment rule takes the pref¬ 
erences of the agents into account in order to allocate each 


agent a fraction of the object. If the objects are indivisible but 
allocated in a randomized way, the fraction can also be inter¬ 
preted as the probability of receiving the object. Randomiza¬ 
tion is widespread in resource allocation as it is a n atural way 
to ensure procedural fairness iBudish et al., 201^. 


rule 1 

Bogomolnaia and Heo, 2012| 

Bogomolnaia and Moulin, 

20011 IBudish et al., 20131 

Katta and Sethuraman, 20061 

Kojima, 20091 lYilmaz, 2010[ 

Saban and Sethuraman, 2014). 


PS works as follows: each agent expresses a linear order 
over the set of housesQEach house is considered to have a 
divisible probability weight of one. Agents simultaneously 
and at the same speed eat the probability weight of their 
most preferred house that has not yet been completely eaten. 
Once a house has been completely eaten by a subset of the 
agents, each of these agents starts eating his next most pre¬ 
ferred house that has not been completely eaten (i.e., they 
may “join” other agents already eating a different house or be¬ 
gin eating new houses). The procedure terminates after all the 
houses have been completely eaten. The random allocation of 
an agent by PS is the amount of each object he has eaten. Al¬ 
though PS was originally defined for the setting where the 
number of houses is equal to the number of agents, it can be 
used without any modificat ion for any number of houses rel 
ative to the number agents iBogomolnaia and Moulin, 2001 


Kojima, 2009] . 


In order to compare random allocations, an agent needs 
to consider relations between them. We con sider two well- 
known relations between random allocation ISchulman andl 


Vazirani, 2012 Saban and Sethuraman, 2014 Cho, 2012|: (i) 
expected utility (EU), and (ii) downward lexicographic (DL). 

For EU, an agent prefers an allocation that yields more ex¬ 
pected utility. Eor DL, an agent prefers an allocation that gives 
a higher probability to the most preferred alternative that has 
different probabilities in the two allocations. Throughout the 
paper, we assume that agents express strict preferences over 
houses, i.e., they are not indifferent between any two houses. 
The PS rule fares well in terms of fairness and welfare 


gomolnaia and Heo, 20121 [B ogomolnaia and Moul in, 2001 

Budish et al., 2013[ [Kojima, 2009 ( ~ |Yilmaz, 2010| . It sat- 
isfies strong envy-freeness and efficiency with respect to 


'We use the term house throughout the paper though we stress 
any object could be allocated with these mechanisms. 



















































the DL relation iBogomolnaia and Moulin, 2001 Schulman 


and Vazirani, 2012[ Kqjima, 2009| . Generalizations of the 

PS ru le have been recommended and applied in man y set¬ 
tings I Aziz and Stursberg, 2014 |Budish ef ai, 20f^ . The 
PS rule also satisfies some desirable incentive properties; if 
the number of objects is at most the number of agents, then 
PS is DL-strategyproof iBogomolnaia and Moulin, 2001 


Schulman and Vazirani, 2012j . Another well-established rule, 

rando m serial dictator (RSD), is not env y-free, not as efficient 
as PS I Bogomolnaia and Moulin, 2001) , and the f ractional al- 
locations under RSD are #P-complete to compute I Aziz etai, 

IMl . 

Although PS performs well in terms of fairness and wel- 
fare, unlike RSD, it is not strategy proof. Aziz et al. 1 2015a) 
showed that, in the scenario where one agent is strategic, 
computing his best response (manipulation) under complete 
information of the other agents’ strategies is NP-hard for the 
EU relation, but polynomial-time computable for the DL rela¬ 
tion. In this paper, we consider the situation where all agents 
are strategic. We especially focus on pure Nash equilibria 
(PNE) — reported preferences profiles for which no agent 
has an incentive to report a different preference. We exam¬ 
ine the following natural questions for the first time; (i) What 
is the nature of best response dynamics under the PS rule? 
(ii) Is a (pure) Nash equilibrium always guaranteed to exist? 
(Hi) How efficiently can a (pure) Nash equilibrium be com¬ 
puted? (iv) What is the difference in quality of the various 
equlibria that are possible under the PS rule? In related work, 
Ekici and Kesten 1 2012) showed that when agents are not 


truthful, the outcome of PS may not satisfy desirable prop 
erties related to efficiency and envy-freeness. Heo and Man 
junath ] 2012 | provided a necessary and sufficient condition 
for implementability of Nash equilibrium for the random as¬ 
signment problem. 


Contributions. Eor the PS rule we show that expected utility 
best responses can cycle for any cardinal utilities consistent 
with the ordinal preferences. This is significant as Nash dy¬ 
namics in matching theory has been an acti ve area of research, 
espec ially for the stable matching problem | Ackermann et ai, 
20111 , and the presence of a cycle means that following a se¬ 


quence of best responses is not guaranteed to result in an equi¬ 
librium profile. We then prove that a pure Nash equilibrium 
(PNE) is guaranteed to exist for any number of agents and 
houses and any utilities. To the best of our knowledge, this is 
the first proof of the existence of a Nash equilibrium for the 
PS rule. Eor the case of two agents we present a linear-time 
algorithm to compute a preference profile that is in PNE with 
respect to the original preferences. We show that the general 
problem for computing a PNE is NP-hard. Einally, we run a 
set of experiments on real and synthetic preference data to 
evaluate the welfare achieved by PNE profiles compared to 
the welfare achieved under the truthful profile. 


2 Preliminaries 


complete, transitive and strict ordering on H representing the 
preferences of agent i over the houses in H. A fractional as¬ 
signment is an [n x m) matrix such 

that for all i G N, and hj € H, 0 < p(i){hj) < 1; and 
for all j G m}, J2ieN— i- The value 

p{i) (hj) is the fraction of house hj that agent i gets. Each row 
p{i) = {p{i){hi),... ,p{i){hm)) represents the allocation of 
agent i. A fractional assignment can also be interpreted as a 
random assignment where p(i){hj) is the probability of agent 
i getting house hj. 

Given two random assignments p and q,p(i) g(i) i.e., 
a player i DL (downward lexicographic) prefers allocation 
p(i) to q{i) if p{i) 7 ^ q{i) and for the most preferred house h 
such that p(z)(/i) 7 ^ g(z)(fi), we have that p(i)(/i) > q{i){h). 
When agents are considered to have cardinal utilities for the 
objects, we denote by Ui{h) the utility that agent i gets from 
house h. We will assume that the total utility of an agent 
equals the sum of the utilities that he gets from each of the 
houses. Given two random assignments p and q, p{i) 
q{i), i.e., a player i EU (expected utility) prefers allocation 
pli) to q{i) if ■p{i){h) > Y^haH Ui{h)-q{i){h). 

Since for all i G N, agent i compares assignment p with as¬ 
signment q only with respect to his allocations p{i) and q{i), 
we will sometimes abuse the notation and use p q for 
p{i) q(i). 

A random assignment rule takes as input an assignment 
problem (N, H, y) and returns a random assignment which 
specifies what fraction or probability of each house is allo¬ 
cated to each agent. We will primarily focus on the expected 
utility setting but will comment on and use DL wherever 
needed. 


The Probabilistic Serial Rule and Equilibria. The Prob¬ 
abilistic Serial (PS) rule is a random assignment algorithm 
in which we consider each house as infinitely divisible. At 
each point in time, each agent is eating (consuming the prob¬ 
ability mass of) his most preferred house that has not been 
completely eaten. Each agent eats at the same unit speed. 
Hence all the houses are eaten at time m/n and each agent 
receives a total of m/n units of houses. The probability of 
house hj being allocated to i is the fraction of house hj 
that i has eaten. The PS fractional assignment can be com¬ 


puted in time 0(m n). We refer the r eader to Bogomolnaia 
and Moulin pOOl) or Kojima 1 2009) for alternative defini¬ 


tions of PS. The following examp le from Bogomolnaia and 
|Moulin|[Aziz et ar| l |2001|l2015a| shows how PS works. 


Example 1 (PS rule). Consider an assignment problem with 
the following preference profile. 


;^i; hi,h2,h3 

>-2- h2,hi,h3 

h2,h3,hi 


An assignment problem [N, H, '^) consists of a set of agents 
N = {1,..., n}, a set of houses H — {hi ,..., hm} and a 
preference profile ;^= in which denotes a 


Agents 2 and 3 start eating /i 2 simultaneously whereas agent 
1 eats hi. When 2 and 3 finish / 12 . agent 3 has only eaten half 
of hi. The timing of the eating can be seen below. 
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Agent 3 
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The final allocation computed by PS is 

/3/4 0 l/4\ 

1/4 1/2 1/4 . 

V 0 1/2 1/2/ 

Consider the assignment problem in Example [T] If agent 1 
misreports his preferences as follows: / 12 , /ii, / 13 , then 

/1/2 1/3 l/6\ 

1/2 1/3 1/6 . 

V 0 1/3 2/3/ 

If we suppose that ui{hi) = 7, Ui(/i 2 ) = 6 , and ui{h^) = 0, 
then agent 1 gets more expected utility when he reports . 
In the example, the truthful profile is in PNE with respect to 
DL preferences but not expected utility. 

We study the existence and computation of Nash equilibria. 
Eor a preference profile we denote by the pref¬ 

erence profile obtained from by replacing agent Es prefer¬ 
ence by 


3 Nash Dynamics 

When considering Nash equilibria of any setting, one of the 
most natural ways of proving that a PNE always exists is to 
show that better or best responses do not cycle which implies 
that eventually, Nash dynamics terminate at a Nash equilib¬ 
rium profile. Our first result is that DL and EU best responses 
can cycle. Eor EU best responses, this is even the case when 
agents have Borda utilities. 

Theorem 1. With 2 agents and 5 houses where agents have 
Borda utilities, EU best responses can lead to a cycle in the 
profile. 

Proof. The following 5 step sequence of best responses leads 
to a cycle. We use U to denote the matrix of utilities of the 
agents over the houses such that {7[1,1] is the utility of agent 
1 for house hi. Note that P starts as the truthful reporting in 
our example. The initial preferences and utilities of the agents 
are: 


>-i: /l2, ^3 j ^5j ^4, ^1 

^2- ^5? ^3; ^45 ^1; ^2 



3 1 
3 2 



This yields the following allocation and utilities at the start: 


PS{yi, ^2) — 


1/2 1 1/2 1/2 0 
1/2 0 1/2 1/2 1 


.EUn = 


In Step 1, agent 1 deviates to increase his utility. He reports 
the preference >~'l' ^ 3 ) ^ 4 i ^2) hi, ft.5 ; which results in 


PS{>'i,>2) 


fo 1 1 1/2 0\ 
\^1 0 0 1/2 ij 


,EUi = 



In Step 2, agent 2 changes his report to ^ 2 '. / 13 , / 14 , ft. 5 , /ii, /i 2 - 
This increases his utility to 7 and decreases the utility of agent 
1 to 6. 


In Step 3, Agent 1 changes his report to 

/13, h^, h 2 , hi, /14. This increases the utility of agent 1 to 7.5 
and decreases the utility of agent 2 to 4.5. 

In Step 4, Agent 2 changes his report to 

/ 15 , /i 3 , / 14 , ft-i, /i 2 - which increases his expected utility to 6.5 
while decreasing the expected utility of agent 1 to 7. 

In Step 5, Agent 1 changes his report to 

h^,hii,h 2 ,hi,h^. Notice that and ;^ 2 =;^ 2 - This is 

the same profile as the one of Step 1, so we have cycled. □ 


It can be verified that every response in this example is both 
an EU best response (with respect to any cardinal utilities 
consistent with the ordinal preferences) and also a DL best 
response. Hence, DL best responses and EU best responses 
(with respect to any cardinal utilities consistent with the ordi¬ 
nal preferences) can cycle. 

The fact that best responses can cycle means that simply 
following best responses need not result in a PNE. Hence the 
norma l form game induced by the PS rule is not a potential 
game i Monderer and Shapley, 199^ . Checking whether an 
instance has a Nash equilibrium appears to be a challenging 
problem. The naive method requires going through 0(m!") 
profiles, which is super-polynomial even when n = 0(1) or 
m = 0 ( 1 ). 


4 Existence of Pure Nash Equilibria 

Although it seems that computing a Nash equilibrium is a 
challenging problem (we give hardness results in the next 
section), we show that at least one (pure) Nash equilibrium 
is guaranteed to exist for any number of houses, any number 
of agents, and any preference relation over fractional alloca¬ 
tions!^ The proof relies on showing that the PS rule can be 
modelled as a perfect information extensive form game. 

Theorem 2. A PNE is guaranteed to exist under the PS rule 
for any number of agents and houses, and for any relation 
between allocations. 

Proof sketch. Consider running PS on all possible m!” pref¬ 
erence profiles for n agents and m objects. In each profile 
i, let t],..., t'l' be the ki different time points in the PS al¬ 
gorithm tTin for the i-th profile when at least one house is 
finished. Let g = GCD({f/^^ ~ '■ i ^ {Ij • • ■ > — 1}) * G 

{1,..., to!"}) where GCD denotes the greatest common di¬ 
visor. Since in each profile i, >0 for all j G 

{0,..., ki — 1}, we have that g is finite and greater than zero. 

The time interval length g is small enough such that each 
run of the PS rule can be considered to have m/g stages of 
duration g. Each stage can be viewed as having n sub-stages 
so that in each stage, agent i eats g/n units of a house in sub¬ 
stage i of a stage. In each sub-stage only one agent eats g/n 
units of the most favoured house that is available. Hence we 
now view PS as consisting of a total of mn/g sub-stages and 
the agents keep coming in order 1,2,..., n to eat p units of 
the most preferred house that is still available. If an agent eats 
g units of a house in a stage then it will eat g units of the same 
house in his sub-stage of the next stage as long as the house 

^We already know from Nash’s original result that a mixed Nash 
equilibrium exists for any game. 











has not been fully eaten. Consider a perfect information ex¬ 
tensive form game tree. For a fixed reported preference pro¬ 
file, the PS rule unravels accordingly along a path starting at 
the root and ending at a leaf. Each level of the tree represents 
a sub-stage in which a certain agent has his turn to eat g units 
of his most preferred available house. Note that there is a one- 
to-one correspondence between the paths in the tree and the 
ways the PS algorithm can be implemented, depending on the 
reported preference. 

A subgame perfect Nash equilibrium (SPNE) is guaranteed 
to exist for such a game via backward induction: starting from 
the leaves and moving towards the root of the tree, the agent 
at the specific node chooses an action that maximizes his util¬ 
ity given the actions determined for the children of the node. 
The SNPE identifies at least one such path from a leaf to the 
root of the game. The path can be used to read out the most 
preferred house of each agent at each point. The information 
provided is sufficient to construct a preference profile that is 
in Nash equilibrium. Those houses that an agent did not eat 
at all can conveniently be placed at the end of the preference 
list. Such a preference profile is in Nash equilibrium. □ 


5 Complexity of Pure Nash Equilibrium 

Our argument for the existence of a Nash equilibrium is con¬ 
structive. However, naively constructing the extensive form 
game and then computing a subgame perfect Nash equi¬ 
librium requires exponential space and time. It is unclear 
whether a sub-game perfect Nash equilibrium or any Nash 
equilibrium preference profile can be computed in polyno¬ 
mial time. 


5.1 General Complexity Results 

In this section, we show that computing a PNE is NP-hard 
and verifying whether a profile is a PNE is coNP-complete. 
Recently it was show n that computing an expected utility bes t 
response is NP-hard [ Aziz et ai, 20151^ Aziz et aL, 2015a) . 
Since equilibria and best responses are somewhat similar, one 
would expect that problems related to equilibria under PS are 
also hard. However, there is no general reduction from best 
response to equilibria computation or verification. In view of 
this, we prov e results regarding PNE by closely analyzing the 
reduction in | Aziz etai, 2015b| . Eirst, we show that checking 
whether a given preference profile is in PNE under the PS rule 
is coNP-complete. 

Theorem 3. Given agents ’ utilities, checking whether a given 
preference profile is in PNE under the PS rule is coNP- 
complete. 

Proof sketch. Conside r the reduction from 3SAT to an as - 
signment setting from I Aziz et ai, 2015b[ Aziz et ai, 2015a) . 
We show that checking whether the truthful preference profile 
is in PNE is coNP-complete. The problem is in coNP, since a 
Nash deviation is a polynomial time checkable No-certificate. 
The original reduction considers one manipulator (agent 1) 
while the other agents \ {1} are ‘non-manipulators’. In 
the original reduction, the utility functions of agents in N' — 
N \ {1} are not specified. We specify the utility function of 
agents in N' as follows: the utility of an agent in N' for his 


j-th most preferred house is where n = | A^| and 

m is the number of houses. These utility functions can be rep¬ 
resented in space that is polynomial in 0{n-\-m). We rely on 
2 main observations about the original reduction. Eirst, in the 
truthful profile, whenever an agent finishes eating a house all 
houses have either been fully allocated or are only at most 
half eaten. Second, in the truthful profile every house except 
the prize house (the last house that is eaten) is eaten by at least 
2 agents. We now show that due to the utility function con¬ 
structed, each agent from N' is compelled to report truthfully. 
Assume for contradiction that this is not the case, and let us 
consider the earliest house (when running the PS rule) that 
some agent i G N' starts to eat although he prefers another 
available house h. Let k denote the number of agents who eat 
a fraction of h under the truthful profile. By reporting truth¬ 
fully, we show that agent i can get = l/4n more 

of h than by delaying eating h. Let us consider how much ad¬ 
ditional fraction of h agent i can consume by reporting truth¬ 
fully. If he reports truthfully, he can start eating h earlier and, 
in the worst case, he can only start l/2n time units earlier. 
This means that h is consumed earlier by a time of 1 /2n if i 
reports truthfully. Consider the time interval of length 1 /2n 
between the time when h is finished when i is truthful about 
h and the time h is finished when i delays eating h. In this last 
stretch of time interval l/2n, i gets j: ' ^ of h extra when 
he does not report truthfully. Hence by reporting truthfully, i 
gets at least more of h which is at least 1 /4n since 

k > 2. Due to the utilities constructed, even if i gets all the 
less preferred houses, he cannot make up for the loss in utility 
for getting only 1 /An of h. 

Now that we have established that the agents in N' report 
truthfully in a PNE, it follows that the truthful preference pro¬ 
file is in PNE iff the manipulator’s truthful report is his best 
EU response. Assuming that the agents in \ {1} report 
truthfully, checking whether the truthful preference is agent 
I’s best response was shown to be NP-hard. We have shown 
that the agents N' report truthfully in a PNE. Hence checking 
whether the truthful profile is in PNE is coNP-hard. □ 

Next, we show that computing a PNE with respect to the 
underlying utilities of the agents is NP-hard. 

Theorem 4. Given agent’s utilities, computing a preference 
profile that is in PNE under the PS rule is NP-hard. 


Proof. The same argument as above shows that the agents in 
N' play truthfully in a PNE. Hence, a preference profile is in 
PNE iff agent 1 reports his EU best response. It has already 
been shown that compu ting this EU best response is NP- 
hard I Aziz et ai, 2015b) when the other agents are N \ {1} 
and report truthfully. Thus computing a PNE is NP-hard. □ 


5.2 Case of Two Agents 

In this section, we consider the case of two agents since many 
disputes involve two parties. Since an EU best response can 


be computed in linear time fo r the case of two agents I Aziz et 
ai, 2015^ Aziz etai, 2015a|, it follows that it can be verified 


whether a profile is a PNE in polynomial time as well. 

We can prove the following theorem for the “threat profile” 
whose construction is shown in Algorithm[T] 























Theorem 5. Under PS and for two agents, there exists a pref¬ 
erence profile that is in DL-Nash equilibrium and results in 
the same assignment as the assignment based on the truthful 
preferences. Moreover, it can be computed in linear time. 

Proof. The proof is by induction over the length of the con¬ 
structed preference lists. The main idea of the proof is that if 
both agents compete for the same house then they do not have 
an incentive to delay eating it. If the most preferred houses do 
not coincide, then both agents get them with probability one 
but will not get them completely if they delay eating them. 
The algorithm is described as Algorithm[T] 

We now prove that Qi is a DL best response against Q 2 
and Q 2 is a DL best response against Qi. The proof is by 
induction over the length of the preference lists. For the first 
elements in the preference lists Qi and Q 2 , if the elements co¬ 
incide, then no agent has an incentive to put the element later 
in the list since the element is both agents’ most preferred 
house. If the maximal elements do not coincide i.e. h f h!, 
then 1 and 2 get h and h' respectively with probability one. 
However they still need to express these houses as their most 
preferred houses because if they don’t, they will not get the 
house with probability one. The reason is that h is the next 
most preferred house after h' for agent 2 and h' is the next 
most preferred house after h for agent 1. Agent 1 has no in¬ 
centive to change the position of h' since h' is taken by agent 
2 completely before agent 1 can eat it. Similarly, agent 2 has 
no incentive to change the position of h since h is taken by 
agent 1 completely before agent 2 can eat it. Now that the 
positions of h and h' have been completely fixed, we do not 
need to consider them and can use induction over Qi and Q 2 
where h and h' are deleted. □ 


The desirable aspect of the threat profile is that since it 
results in the same assignment as the assignment based on 
the truthful preferences, the resulting assignment satisfies all 
the desirable properties of the PS outcome with respect to 
the original preferences. Since a DL best response algorithm 
is also an EU best respon se algorithm for the case of two 
agents I Aziz et al, 2015a I, we get the following corollary. 


Corollary 1. Under PS and for 2 agents, there exists a pref¬ 
erence profile that is in Nash equilibrium for any utilities con¬ 
sistent with the ordinal preferences. Moreover it can be com¬ 
puted in linear time. 


6 Experiments 

We conducted a series of experiments to understand the num¬ 
ber and quality of equilibria that are possible under the PS 
rule. For quality, we use the utilitarian social welfare (SW) 
function, i.e., the sum of the utilities of the agents. We are 
limited by the large search space needed to examine equi¬ 
libria. For instance, for each set of cardinal preferences we 
generate, we consider all misreports (ml) for all agents (n) 
leaving us with a search space of size m!" for each of the 
samples for each combination of parameters. Thus, we only 
report results for small numbers of agents and houses in this 
section. We generated 300 samples for each combination of 
preference model, number of agents, and number of items; 


Input: ({1,2},//, (^1,^2)) 

Output: The “threatprofile” (Qi, Q2) where Qi is the preference 
list of agent i for i £ { 1 , 2 }. 

1 Let Pi be the preference list of agent i £ { 1 , 2 } 

2 Initialise Qi and Q2 to empty lists. 

3 while Pi and P2 are not empty do 

4 Let h — first(Pi) and h' = first(P2) 

5 Append htoQi, Append h' to Q2 

6 Delete h and h' from Pi and P2 
1 iSh f h' then 

8 Append/i'to Qi; Appendto (32; 

9 return (Qi,Q2). 

Algorithm 1: Threat profile DL-Nash equilibrium for 2 agents 
(which also is an EU-Nash equilibrium) which provides the 
same allocation as the truthful profile. 


reporting the aggregate statistics for these experiments for 4 
agents in Figures [T] and Each individual sample with 4 
agents and 4 houses took about 15 minutes to complete using 
one core on an Intel Xeon E5405 CPU running at 2.0 GHz 
with 4 GB of RAM running Debian 2.6.32. The results for 
2 agents and up to 5 houses as well as 3 agents and up to 4 
houses are similar. 


We used a variety of common statistical models to gen- 


erate data (see, e.g., iMattei, 2011 

Mallows, 1957 

Lu and 

Boutilier, 2011 Berg, 1985 

1): the Impartial Culture (IC) 


model generates all preferences uniformly at random; the Sin¬ 
gle Peaked Impartial Culture (SP-IC) generates all preference 
profiles that are single peaked uniformly at random; Mallows 
Models (Mallows) is a correlated preference model where the 
population is distributed around a reference ranking propor¬ 
tional to the Kendall-Tau distance; Polya-Eggenberger Urn 
Models (Urn) creates correlations between the agents, once 
a preference order has been randomly selected, it is subse¬ 
quently selected with higher probability. In our experiments 
we set the probability that the second order is equivalent to the 
first to 0.5. We also used real world data from PrefLib ||Mat-| 
tei and Walsh, 2013| : AGH Course Selection (ED-00009). 
This data consists of students bidding on courses to attend in 
the next semester. We sampled students from this data (with 
replacement) as the agents after we restricted the preference 
profiles to a random set of houses of a specified size. 

To compare the different allocations achieved under PS we 
need to give each agent not only a preference order but also 
a utility for each house. Eormally we have, for all z £ A^ 
and all hj G H, a value Ui{hj) £ M. To generate these util¬ 
ities we use what we call the Random model: we uniformly 
at random generate a real number between 0 and 1 for each 
house. We sort this list in strictly decreasing order, if we can¬ 
not, we generate a new list (we discarded 0 lists in our exper¬ 
iments). We normalize these utilities such that each agent’s 
utility sums to a constant value (here, the number of houses) 
that is the same for all agents. In prior experiments we found 
the Random utility model to be the most manipulable and 
admit the worst equilibria. Therefore, we only focus on this 
utility model here (over Borda or Exponential utilities) as it 
represents, empirically, a worst case. We separate equilibria 
into three categories: those where the SW is the same as in 
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Figure 1; Classification of equilibria in settings consisting of four agents with preferences drawn from the six models over 2 to 
4 houses. We can see that the vast majority of the equilibria that were found across all samples had the same social welfare as 
the truthful profile. In general, we see there are roughly the same number of equilibria that increase or decrease social welfare. 
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Figure 2: (A) The maximum and minimum percentage increase or decrease in social welfare over all 300 samples for each of 
the six models with 2 to 4 houses and four agents. We see that the gain of the best profile is, in general, slightly more than the 
loss in the worst profile with respect to the truthful profile. (B) The average number of the 4!"^ = 331, 776 profiles that are in 
equilibria per instance across all combinations of parameters. The more uncorrelated models (IC and SP-IC) admit the highest 
number of equilibria. 


the truthful profile, those where we have a decrease in SW, 
and those where we have an increase in SW. Given the so¬ 
cial welfare of two different profiles, SWx and SW 2 , we use 
percentage change (understand the mag¬ 
nitude of this difference. 

For all models, for all combinations of 2 to 4 agents and 2 
to 5 houses there are, generally, slightly more equilibria that 
increase social welfare compared to the truthful profile than 
those that decrease it, as illustrated in Figure However, 
the vast majority of equilibria have the same social welfare 
as the truthful profile, and the best equilibria are, in general, 
slightly better than the worst equilibria, as illustrated in Fig¬ 
ure]^ Hence, if any or all of the agents manipulate, there may 
be a loss of SW at equilibria, but there is also the potential 
for large gains; and the most common outcome of all these 
agents being strategic is that, dynamically, we will wind up 
in an equilibria which provides the same SW as the truthful 
one. Our main observations are: (i) The vast majority of equi¬ 
libria have social welfare equal to the social welfare in the 
truthful profile, (ii) In general, the number of PNE that have 
increased social welfare (with respect to the truthful profile) is 
slightly more than the number of PNE that have decreased so¬ 
cial welfare, (iii) The maximum increase and decrease in SW 


in equilibria compared to the truthful profile was observed to 
be under 23% and 18% respectively . (iv) There are very few 
profiles that are in equilibria, overall. Profiles with relatively 
high degrees of correlation between the preferences (Urn and 
AGH 2004) have fewer equilibrium profiles than the less cor¬ 
related models (IC and SP-IC). (v) These trends appear stable 
with small numbers of agents and houses. We observed simi¬ 
lar results for all combinations. 

7 Conclusions 

We conducted a detailed analysis of strategic aspects of the 
PS rule including the complexity of computing and verifying 
PNE. The fact that PNE are computationally hard to compute 
in general may act as a disincentive or barrier to strategic be¬ 
havior. Our experimental results show PS is relatively robust, 
in terms of social welfare, even in the presence of strategic 
behaviour. Our study leads to a number of new research direc¬ 
tions. It will be interesting to extend our algorithmic results 
to the extens ion of PS for indifferences iKatta and Sethu-| 
raman, 2006] . Additionally, studying strong Nash equilibria 
and a deeper analysis of Nash dynamics are other interesting 
directions. 
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